Delayed Nondeterminism in Continuous-Time Markov Decision Processes

نویسندگان

  • Martin R. Neuhäußer
  • Mariëlle Stoelinga
  • Joost-Pieter Katoen
چکیده

Schedulers in randomly timed games can be classified as to whether they use timing information or not. We consider continuous-time Markov decision processes (CTMDPs) and define a hierarchy of positional (P) and historydependent (H) schedulers which induce strictly tighter bounds on quantitative properties on CTMDPs. This classification into time abstract (TA), total time (TT) and fully time-dependent (T) schedulers is mainly based on the kind of timing details that the schedulers may exploit. We investigate when the resolution of nondeterminism may be deferred. In particular, we show that TTP and TAP schedulers allow for delaying nondeterminism for all measures, whereas this does neither hold for TP nor for any TAH scheduler. The core of our study is a transformation on CTMDPs which unifies the speed of outgoing transitions per state.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Computation of Time-Bounded Reachability Probabilities in Uniform Continuous-Time Markov Decision Processes

A continuous-time Markov decision process (CTMDP) is a generalization of a continuous-time Markov chain in which both probabilistic and nondeterministic choices co-exist. This paper presents an efficient algorithm to compute the maximum (or minimum) probability to reach a set of goal states within a given time bound in a uniform CTMDP, i.e., a CTMDP in which the delay time distribution per stat...

متن کامل

On $L_1$-weak ergodicity of nonhomogeneous continuous-time Markov‎ ‎processes

‎In the present paper we investigate the $L_1$-weak ergodicity of‎ ‎nonhomogeneous continuous-time Markov processes with general state‎ ‎spaces‎. ‎We provide a necessary and sufficient condition for such‎ ‎processes to satisfy the $L_1$-weak ergodicity‎. ‎Moreover‎, ‎we apply‎ ‎the obtained results to establish $L_1$-weak ergodicity of quadratic‎ ‎stochastic processes‎.

متن کامل

A Characterization of Meaningful Schedulers for Continuous-Time Markov Decision Processes

Continuous-time Markov decision process are an important variant of labelled transition systems having nondeterminism through labels and stochasticity through exponential fire-time distributions. Nondeterministic choices are resolved using the notion of a scheduler. In this paper we characterize the class of measurable schedulers, which is the most general one, and show how a measurable schedul...

متن کامل

Game Theory in Formal Verification Scribe Notes for Lecture 8 Markov Decision Processes

Simple Stochastic Game nondeterminism nondeterminism probability probability nondeterminism nondeterminism probability 1 Definitions Markov Chain (MC) Definition 1. A Markov Chain ((S, E), δ) is a graph (S, E) with a function δ : S → D(S) that maps every state to a probability distribution of successor states. There is an edge between two states s, t in S iff the probability of going from s to ...

متن کامل

Stochastic Nondeterminism and Effectivity Functions

This paper investigates stochastic nondeterminism by relating nondeterministic labelled Markov processes and stochastic effectivity functions to each other; the underlying state spaces are continuous. Both generalizations to labelled Markov transition systems have been proposed recently with differing intentions. It turns out that they display surprising similarities and interesting differences...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009